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Abstract. We address the problem of estimating pure qubit states with non-ideal 
(noisy) measurements in the multiple-copy scenario, where the data consists of a 
number N of identically prepared qubits. We show that the average fidelity of the 
estimates can increase significantly if the estimation protocol allows for inconclusive 
answers, or abstentions. We present the optimal such protocol and compute its fidelity 
for a given probability of abstention. The improvement over standard estimation, 
without abstention, can be viewed as an effective noise reduction. These and other 
results are exemplified for small values of N. For asymptotically large N, we derive 
analytical expressions of the fidelity and the probability of abstention, and show that 
for a fixed fidelity gain the latter decreases with N at an exponential rate given 
by a Kulback-Leibler (relative) entropy. As a byproduct, we obtain an asymptotic 
expression in terms of this very entropy of the probability that a system of N qubits, 
all prepared in the same state, has a given total angular momentum. We also discuss an 
extreme situation where noise increases with N and where estimation with abstention 
provides a most significant improvement as compared to the standard approach. 



PACS numbers: 03.67-a, 03.65.Ta, 42.50.-p 



Beating noise with abstention in state estimation 



2 



1. Introduction 

Knowing the state of a system is a key task in quantum information processing. An 
unknown quantum state can only be unveiled by means of measurements. These, 
however, provide only partial knowledge about the system and, furthermore, this 
information gain comes always at the expense of destroying the state. Only when a 
reasonably large number N of identically prepared copies of the system is available an 
accurate estimation of the state is possible. For a given N, the aim is then to find the 
measurement protocol that yields the best estimate of the input state. 

The standard estimation optimization problem is suited for a situation where, say, 
an experimentalist is confronted with an unknown state of a system of which she is asked 
to provide an estimate, based of course on the results of a measurement of her choice. 
A quantitative assessment of her performance is usually given by the expected value 
of the fidelity (or some other distinguishability measure) between the unknown input 
and her guess (see below). Hence, it is implicitly assumed that the experimentalist is 
obliged to provide such guess regardless of the measurement outcome she obtains. In 
this context, many results have been obtained over the last years in a large variety of 
scenarios [H[2l[3llll[5l[6l[Tl[8l|9l HD1 HH [12l H3] . 

Here, we will study a variation of this setting suited for a situation where the 
experimentalist is allowed to decide whether to provide a guess or abstain from doing so. 
Of course, this decision cannot be based on the actual state of the system (which is 
unknown by definition) but rather on the result of a measurement. This relaxation 
of the original setting is very useful because it enables the experimentalist to post- 
select her measurement outcomes in order to provide a more accurate guess. That is, 
the possibility of abstaining enables her to discard instances where the measurement 
outcome turns out not to be informative enough. We will find that abstention can 
provide an important advantage, specially in noisy scenarios. The problem of 'state 
estimation with abstention' [|]is specially relevant in situations where the experimentalist 
can afford to re-run the experiment, i.e. she can easily prepare a new instance of the 
problem, or where she prioritizes having high quality estimates. 

Post-selection is a widely used tool in quantum information, particularly in 
experimental scenarios, where one has special demands or constrains. A form of 
abstention has been already explored in state discrimination [TJ], another important 
quantum statistical inference primitive. Discrimination aims at identifying in which, 
out of a set of known quantum states, a system has been prepared. Two 
fundamental approaches are usually considered: the so-called 'minimum-error', where 
the experimentalist always has to provide a conclusive answer, at the expense, of course, 
of being wrong with certain probability |15j . and 'unambiguous discrimination', where no 
errors are permitted, but instead an inconclusive answer (abstention) may be given with 
some probability [16] . By varying the allowed rate of inconclusive answers [T71 [HI [191 ED] , 
we may go from one approach to the other f2T\ |2"2"| 125] . The possibility of abstaining 



| We thank G. Chiribella for suggesting this word. 
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has been studied in [21] for phase estimation, and in [25] for direction estimation with 
arbitrary pure input signals. In both cases the results show a significant improvement 
over the standard (without abstention) approach. 

In this work we consider the optimal estimation of a completely unknown pure 
qubit state when N copies of it are available for measurement and when certain rate 
(probability and rate will be used interchangeably throughout the paper) of abstention Q 
is permitted. We will show that in an ideal noise-free scenario, abstention does not 
improve the estimation accuracy. However, it does in a realistic noisy scenario, as we 
claimed above. Here we will consider a simplified model where noisy measurements will 
be replaced by local depolarizing channels followed by ideal measurements. 

The paper is organised as follows. In the next section we consider estimation 
without abstention. More precisely, we obtain the protocol that gives the best estimate 
of the state of a qubit based upon non-ideal measurements on N independent and 
identically prepared systems. In Section |3j estimation with abstention is introduced, 
and the optimal protocol for a fixed value of the abstention rate Q is obtained. We 
study the asymptotic regime of large N and derive the corresponding maximum fidelity 
and probability of abstention. As an example, we also consider an scenario where 
abstention gives a drastic improvement. This is the case when noise increases with N 
in such a way that the fidelity of the estimation approaches a finite value less than one 
as N becomes large. We close the paper with some brief conclusions and present an 
outlook for future work. 

2. No abstention 

Let us consider N copies of a completely unknown pure qubit state |n) (throughout 
the paper n will denote a unit Bloch vector) that we wish to estimate by performing 
a realistic, and therefore noisy, quantum measurement. We model it as an ideal 
measurement preceded by the single-qubit depolarizing channel acting on every copy: 

77 

£{p) = (1 - rj)p + -(a x pa x + a y po-y + a z pa z ), (1) 

where with probability 1 — 77 no error occurs, while with probability 77 the state is 
affected by either a bit-flip, a phase-flip, or both. This error probability 77 is assumed to 
be known by the experimentalist, therefore, for the purpose of analyzing the effects of 
noise in the estimation process, we will transfer its effect to the input states and optimize 
the estimation protocol over ideal measurements. Hence, we will consider input states 
of the form 

1 1+rn-a 

p(n) = r \n){n\ + (1 -r)- = , (2) 

with r = 1 — (4/3)7/. In words, we will assume that the input states either 
do not change with probability r or they become completely randomized with 
probability 1 — r = (4/3)7/. The original problem is thus equivalent to the estimation of 
a pure state \n) (or of a uniformly distributed Boch vector n) based upon the outcomes 
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of an appropriate ideal measurement on iV copies of the mixed state p(n) in Eq. ([2]), 
i.e., on the state p(n)® N = r(n). 

For each measurement outcome x an estimate \n x ) is provided according to some 
guessing rule x ~~ >" !%)• We choose to quantify the quality of the estimate by means of 
the squared overlap 

f(n,n x ) = \(n\n x )\ 2 , (3) 

also known as the fidelity. The overall quality of the estimation protocol is then given 
by the average fidelity 

F = Yl / dn f^^x)P(x\n), (4) 
x J 

where dn = sin 9d6 d<f)/(4ir) is the uniform probability distribution on the two-sphere 
and p(x\n) is the conditional probability of obtaining the outcome x if the input state 
is r{n). This probability is given by the Born rule p{x\n) = tr[n x r(n)], where II x > 
are the elements of a Positive Operator Valued Measure (POVM). They satisfy the 
completeness relation J2 X H x = 1, where 1 denotes the identity operator in the space 
spanned by the input states {r(n)}. The index x mav be discrete, continuous or both. 

A protocol (i.e., a measurement {II X } and a guessing rule x ~ > \™x)) ^ s sa ^ be 
optimal if it maximizes F. For pure states, r = 1, the maximum fidelity is well-known [3]: 

It is also known that the (continuous) covariant POVM 

n(s) = (2J+l)U(s)\J J}(J J\U j (s) (6) 

(with the obvious guessing rule H(s) — > \s)) is optimal. In ([6]), we use the 
standard notation, where {\jfn)y m= _j is the eigenbasis of the total angular momentum 
operators J 2 and J z . We denote by U(s) = {u(s)}® N , u(s) E SU(2), (the 
unitary representation of) the rotation that maps the unit (Bloch) vector z into s 
[thus = \sj\i and we have also introduced the definition J = N/2. Note that the 

POVM {Il(s)} acts on the symmetric subspace of largest total angular momentum J, 
of dimention 2 J + 1 = N + 1. In terms of J, ^ can also be written as 

MO^H* 1 +*•'>• < 7 > 

Mixed states span a much larger Hilbert space and the computation becomes more 
involved. It greatly simplifies in the total angular momentum basis, where the input 
state r(n) is block-diagonal We have 

T ( H ) = $^Pi« r i«(^)' ( 8 ) 

i=jmin 01=1 

where Tj a (n) is the normalized mixed state 

1 j 

run) = — J2 Rm u ^ \i m ' a ) 0' m ; a \ f/t ^)> ( 9 ) 



J m=—j 
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with the definitions: 

m=-j 

The additional index a, where a = 1,2, . . . ,rij, labels the various occurrences of the 
irreducible representation of total angular momentum j. The multiplicity rij is given 

by mm 



2J \ _ / 2J 

^-j ; \ j-j-i 

2J ) T 2 i + 1 . (ii) 

In the sum (|9f, j runs from j min = (j m in = 1/2) for N even (odd) to the maximum 
total angular momentum J, in contrast to the pure state case where only the maximum 
value J appears. The numbers pj a > are the probabilities that the state r(n) has 
quantum numbers j and a, i.e., pj a = tr[l JQ r(n)], where Ij a = J2"L=-j b m i a; )(j m ! a \ * s 
the projector onto the corresponding eigenspace. The projector onto the whole subspace 
of total angular momentum j is then 

rij 

h = ha- (12) 

Since the input state is permutation invariant (under the interchange of the individual 
qubits) representations with the same j are just mere repetitions of the same 
representation, they contribute a multiplicative factor of rij to the fidelity through the 
marginal probability pj = J2 a Pj a , which reads 

l-r 2 \ J ( 2J \ 2j + l R j+1 -R~ j 

(16) 



4 J \ J-j I J+j+1 R-l 

One can easily check that YljPj = 1> as ^ should be. 

Because of the block diagonal form of the input states, an obvious optimal 
measurement consists of a direct sum of covariant POVMs, 

J rij 

n(s)= ©IMS), (14) 

3= Jmin OC=l 

where each of them is a straightforward generalization of Eq.([6]): 

U ja (s) = (2j + 1) f/(s) | j j; a)(j j; a| U\s). (15) 

One can easily check that the completeness condition f dsU(s) = t holds. The total 
fidelity then is 



\ ( 1 + E pa ) 

V 3= Jmin / 



(16) 
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where pQ 

A {jAl = (17) 

with Tj(z) being any one of the normalized states defined in Eq. ([9]), (say, the 
one with a = 1). A straightforward calculation gives 

w^JtE™*-^-^*!^- < 18) 

J m=—j 

Notice that for pure states, one has R — > oo, and in turn (J 2 )j — )■ J, in agreement 
with Eq. 0. 

As will be shown in the next section, for asymptotically large N the probability pj 

peaks at a value of j ~ rJ, which gives the dominant and subdominant contributions to 

the sum in (16). Up to order 1/N, and discarding exponentially vanishing contributions 

[e.g., ~ R~ rJ ], the asymptotic fidelity turns out to be 

1 r + 1 , . 

F = l-— + ■■• . 19 

Nr 2r K J 

This result is interesting on its own and, to the best of our knowledge, has not been 
presented before. Note that for pure states (r = 1) Eq. (19) agrees with the asymptotic 
expression of the fidelity in Eq. 



3. Abstention 



In this section we focus on estimation protocols where the experimentalist is allowed 
not to produce an answer, or abstain, if the outcome of the measurement she performed 
cannot provide a good enough estimate of the unknown state. Obviously, F cannot 
decrease by excluding these abstentions from the average. In noisy scenarios, such as 
that considered in this paper, F actually increases, as will be shown below. Our aim is 
to quantify this gain and find the optimal protocol. In our approach, the probability of 
abstention, Q, is kept fixed, rather than unrestricted, since usually in practical situations 
one cannot afford discarding an unlimited amount of resources/state preparations. 



3.1. General framework 

To enable the possibility of abstaining, the POVM representing the measurement 
must include the abstention operator, which we denote by n , in addition to the 
operators {n x }, each of them associated to a specific estimate |n x ). Thus, the 
completeness relation reads 

^n x + n = i. (20) 

X 

The probability of abstention (abstention rate) and that of producing an estimate 
(acceptance rate) are then given respectively by 

Q = J dn tr [n r(n)] and Q = 1 - Q, (21) 
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and the mean fidelity denned in Q becomes now 

F = g E / rfn M tr [n x r(n)] , (22) 

X 

where notice that the sum does not include the Ho operator and Q takes into account 
the abstentions excluded from the average. 

We next note that for any unitary transformation U of the type defined after 
Eq. (pL the operators {UH X W, UUqW} give the same value of Q and F(Q) as the 
original set {IT X , Il }, provided we change the guessing rule as n x —> %un x , where "Ru is 
the SO (3) rotation whose unitary representation is U. Therefore, one can easily prove 
that n (the set {IT X }) can always be chosen to be SU(2) invariant (covariant) by simply 
averaging over U. In other words, with no loss of generality the POVM elements that 
provide a guess \s) can be chosen as 

fl(s) = U(s)UU^(s), (23) 

where IT > is the so called seed of the POVM (in particular, note that 11(5) = IT). 
The abstention operator then reads 

n = 1- J dsfl(s), (24) 

which is manifestly rotationally invariant (as claimed above). It is thus proportional to 
the identity on each invariant subspace 

J *»j J 
n o= ©«A- Mi, (25) 

j = Jmin a 1 j = jmin 

where a,- are coefficients that satisfy the condition < a,- < 1 and 1,- is defined 



in Eq. (12). Here we have used the permutation invariance of the input state to fix, 
without loss of generality, for all a. We can also choose IT (s) to have the 

block-diagonal form of the input state r(n), namely, 

fi (*)= ©n^a(s). (26) 

j=jmin 01=1 

For given {%}, the optimality of Hj a (s), defined in Eq. (15), clearly ensures that 



fl ja (s) = (1 - aj)U ja {^, (27) 
are also optimal for estimation with abstention. Recalling that the label a is 



unsubstantial, aside from the multiplicative factor rij, we have from (21) that the 
abstention probability is simply 

j 

J = 3 min 



where pj is given in Eq. (13). The coefficients aj can be understood as the probabilities 



of abstention conditioned to the input state having total angular momentum j, 
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i.e., dj 



p(abstention|j). Similarly, for a given j, the probability of producing an 



estimate, or accepting, is a 7 - 



1 — a~ 



p(acceptance|j). 



From Eq. (22) we obtain 



HQ) 



3 — Jmin 



where 



a 



1 



Q 



Q " 



(29) 



(30) 



and the quantity Aj is given in Eqs. (17) and (18). Thus, we are only left with the 
free parameters a,j, which have to be optimized in order to maximize F(Q), subject to 
the constraints < a,j < 1 and (28). Somehow expected, one can show that Aj is a 
monotonically increasing function of j, i.e., Aj_i < Aj, therefore the largest contribution 
to the fidelity is given by Aj. This corresponds to aj = 1 and a, = 0, j < J. Hence, for 
unrestricted probability of abstention, the optimal protocol discards any contribution 
with j < J. This protocol, however, would provide an estimate with a probability that 
decreases exponentially with N, for r < 1, as pj ~ (l/r)[(l + r)/2] N+1 . Notice that in 
a noiseless scenario, r = 1, there is only the contribution j = J, which is already the 
optimal one and therefore abstention is of no use in such case. 

Clearly, for finite Q there can be contributions from other total angular momentum 
eigenspaces (j < J) compatible with Eq. (28). Recalling the monotonicity of Aj, and 
by convexity, it is obvious from Eqs. (29) and (30) that there must exist an angular 
momentum threshold j* such that aj = (a,j = 1), if j < j* (j > j*). The value j* is 
determined through Eq. (28) to be 



f 

Thus, we have 



max jj such that Q — Yuj'^j^Pj 1 — o} 



(31) 



1, 

Pj 
0. 



Q - ££ 



-1 

-Jmin 



Pf 



3 <3 
3 = 3" 
3 > 3" 



(32) 



In a more physical language, the optimal strategy consists actually of two successive 
measurements. The experimentalist first measures the total angular momentum j of 
the input state r(n) and decides to abstain (provide a guess) if j < j* (j > j*). 
If j = j*, she simply decides randomly, by tossing a Bernoulli coin with probability aj* 
of coming up heads, and if heads (tails) show up, abstain (provide a guess). In order 
to provide the actual guess, if she decides to do so, she performs the optimal POVM 
measurement {n(s)} [or just {© a rL^s)}] in Eq. (14) on the state ® a Tj a (n) that 
resulted from the first measurement. 
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3.2. Small number of copies 

In Fig. [I] we plot the fidelity gain due to abstention [F(Q) - F(0)]/F(0) = AF/F(0) 
vs. Q for iV = 6, N = 8, and purities of r = 0.3 and r = 0.7. The structure of Eq. (32) 
is apparent from these plots: at Q = (a, = for all j) there is, naturally, no gain; 
kinks sequentially appear at the precise values of Q where a new coefficient cij in (32) 
becomes positive (and j* increases by one); the curves are convex between successive 
kinks, where the one aj that has become positive, a,-*, keeps increasing. This pattern 
repeats until the abstention rate Q reaches a critical value Q cr it at which j* = J, 

, /1 . \ 2J+1 /-, x 2J+1 

1 / 1 + r \ 1/1 



Q CTit = l- P j = !--[ — ) + z [-z-) (33) 



[see Eq. (13)]. Increasing Q further will not provide any additional gain, as the flat 
plateaus of Fig. [T] illustrate. This is so, since one can view the optimal abstention 
protocol as a filtering process where the low angular momentum components of the 
input state are filtered out. Hence, keeping the maximum value of j = J is the optimal 
filtering beyond which no further improvement is possible. Fig. [l](a) shows that in noisy 
scenearios, e.g. r = 0.3, abstention can increase the fidelity quite notably, up to 15%. 
For higher purities the gain is more moderate, as shown in Fig. [T](b) . The enhancement 
in this case is about 4-5% but with an abstention rate slightly above 50%. Further 
results are shown in Fig. |2j where we plot the fidelity gain as a function of the number 
of copies N for abstention rates larger than Q CI it, and for various values of the purity r. 
All the curves have a maximum at a value of iV that varies with the purity. The lower 
the purity, the higher the value of iV at which the maximum occurs (e.g., for r = 0.3 
the maximum gain occurs at N = 12; for r = 0.1 the maximum is off scale at the right 
of the figure). 

As we have seen, the possibility of abstaining enables us to reach values of the 
fidelity that otherwise we could only attain with lower levels of noise. To quantify this 
effective reduction of noise, let us define an effective purity r e g by the implicit equation 
F(r eff , N, 0) = F(r, N, Q). That is, for an estimation setting, given by r, iV,and Q, r e g is 
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the purity of the input states that would provide the same fidelity if the standard strategy 
without abstention (Q = 0) were used instead. Since r is related to the probability of 
error rj in our model of noisy measurements in ([lj, an increase of the effective purity 
corresponds to an effective reduction of the amount of noise in the measurement through 
the relation r] eff = (3/4)(l — r eS ). Figure [3] shows a plot of the effective purity r eff as a 
function of Q for various values of r and N. As can be seen, r e e increases faster at low 
values of N, but it saturates earlier (lower Q cr i t ), reaching a lower value. For low N and 
for a wide range of purities, 0.1 < r < 0.9, we observe a constant effective increase of the 
purity, r e fj rj r + 0.2, for reasonable values of the abstention rate Q. As N increases one 
has to go to higher values of the abstention rate, Q ~ Q C rit, to have a significant gain. 
Hence, a moderate abstention rate is most effective in noisy scenarios when a small, but 
fair, number of copies is available. 

Finally, let us point out that the protocol we have presented requires a projection 
on the total angular momentum eigenspaces. This is a non-local measurement that 
nonetheless can be implemented efficiently [27] . In a more extreme scenario where there 
are no restriction on the abstention rate, one can attain the maximum fidelity with an 
even simpler strategy: perform a local Stern-Gerlach measurement on every qubit (say, 
of the ^-component of the spin) and abstain unless all outcomes agree. This strategy 
renders an abstention probability of Q = 1 — [(1 + r)/2] N , which might be comparable 
to Qcrit in Eq. ([331. 



3.3. Asymptotic regime 

We next compute the analytical expressions of the fidelity in the large N limit. Here it 
is useful to define the variable 

x = - < x < 1, (34) 




N 



Figure 2. Fidelity gain as a function of N for various values of r, indicated in the 
legend, and for Q > Q C rit- 
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1 r 




0.2 0.4 0.6 0.8 1 

Q 

Figure 3. Effective purity r e g as a function of the abstention rate Q for N = 5 
(dotted), 10 (dashed), and 30 (solid), and for purities of r = 0.01, 0.1, 0.3, 0.5, 0.7, 
and 0.9, which can be read off from the values of r e g at Q = 0. 




0.0 0.2 0.4 0.6 0.8 1.0 

X 



Figure 4. Plots of A(x) (blue line with solid circles) and p(x) (red line with empty 
circles) for N — 100 and r = 0.5. The circles represent the quantities Aj and Jpj as a 
function of x = j / J ■ The shaded area indicates the acceptance region for an abstention 
rate Q ~ 93%. 
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which becomes continuous in the limit iV — > oo ( J — > oo). In this case, we can replace pj 
by the continuous probability distribution in [0, 1] defined by 

p{x) = Jp j=xJ , (35) 

so that dxp(s 



1 as N goes to infinity. Eq. (29) can then be approximated by its 



continuous version, which reads 



1 + 



dx p(x)A(x) 



where 



A(x) = A 



j=xj ) 



(36) 



(37) 



where recall that A,- is given in Eqs. (30). From Eq. (32) we see that asymptotically a~ 



becomes the step function 9{x — x*), where x* 
definition 



9{x) 



x > 0; 
x < 0. 



j*/J, and we have used the standard 



(3* 



With this, Eq. (30) becomes 

9(x- 



Ax) 



x 



Q 



-A(x) 



and, in turn, 



1 + 



dxp(x)A(x) 



X* 



It also follows from (28) that 

Q— dxp(x)9(x — x* 



dxp(x). 



(39) 



(40) 



(41) 



At this point, we need to find a good approximation to p(x) that would enable us 
to obtain the explicit form for the asymptotic fidelity. From Eq. (13), and using the 
Stirling formula, we obtain 

.V 1 x{l + r) 




P{X) ~ V 27r v /T^2r(l + x) 
where H(s \\ t) is the (binary) relative entropy 



-iV\ff(i±5||i±t) 



H(s || t) = s log ■ 



log 



1 - s 



(42) 



(43) 



t v ' 

and the approximation is valid for both x and r in the open unit interval (0, 1). The 
appearance of a relative entropy in Eq.(42) can be understood as follows. Our iV-copy 
input state (diagonal in the canonical J n basis) can be thought of as a classical coin 
tossing distribution of iV identical coins with a bias of (1 + r)/2. From the theory of 
types j28] it is well known that the probability to get k heads is given by the Kulback- 
Leibler distance (or relative entropy) between the empirical distribution {/ = k/N, 1— /} 
and the distribution {(1 + r)/2, (1 -r)/2}. That is, p(k) ~ exp{-NH[f || (l + r)/2]}to 
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first order in the exponent. The number of heads k is in one-to-one correspondence with 
the magnetic quantum number, m = k — J, and the conditioned probability p(j\m) is 
strongly peaked at m = j, as one can easily check. It follows that the probability that 
the input state has total angular momentum j, given by p(j) = ^ m p{j\Tn)p(m), will be 
asymptotically determined by the probability distribution p(m), which has a convenient 
expression in terms of the typical and the empirical distribution of up/down outcomes. 

From Eq. (42) it follows that p(x) is peaked at the value x = r, i.e. at j = rJ, as 
shown in Fig. [4] and stated without a proof in Sec. [2j Actually, around the peak, x ~ r, 
the exponent becomes quadratic and p{x) approaches the Gaussian distribution 



P( 



N 



x) 



-N 



2tt(1 



2(l-r' 2 ) 



(44) 



as also follows from the central limit theorem, whereas it falls off exponentially elsewhere. 

It is now apparent that, asymptotically, abstention has negligible impact if 
components with j below rJ are filtered out (x* < r), since the main contribution 
to the fidelity, which comes from the peak around x ~ r, is not excluded from the 
integral in Eq. (40) (only the left exponentially decaying tail is). For the same reason 



[see Eq. (41)], Q ~ 1 (the abstention rate Q is exponentially small), and Eq. (40) yields 
^ „ 1 r + 1 



2N 



+ . . . for x* < r, 



(45) 



which is the same expression as the asymptotic fidelity of the protocol without 



abstention, Eq. (19). 



It is then clear that, in order to have a discernible improvement in the fidelity, the 
abstention threshold x* must lie to the right of the peak of the probability distribution. 



The fidelity in (40) then can be written as 

p(x 



1 + 



Q 



-A{x* 



1 



1 + A(x* 



x > r, 



(46) 



where we have used that for x > x* > r and for large enough N, p(x) falls off 
exponentially and the integral can be approximated by the value of the integrand at its 



lower limit. By the very same argument Eq. (41) gives 



(47) 



which has also been used in (|46|). Using now (|42|) we obtain that in the asymptotic limit 

(48) 



of many copies, the rate at which our protocol provides a guess is 
Q ~ exp [-NH {^f || . 



Recalling Eqs. (17) and (18) we obtain the optimal fidelity: 

r + 1 



1 - 



2Nx* 



+ 



for r < x* < 1, 



(49) 



for a value of Q given by (48). For x* 
whereas for x* — > 1 (Q > Q cr i t 



r the results (19) and (45) are recovered 



the maximum average fidelity is attained 
1 r + 1 



2N 



+ 



(50) 
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The advantage provided by our estimation with abstention protocol can be 
quantified by the effective number of copies that the standard protocol without 
abstention would require to achieve the same fidelity: N c q = (x*/r)N, where x* G [r, 1) 



is determined by the abstention rate Q through (48). For high noise levels 
(low purity, r < 1) our protocol provides an important saving of resources/copies, 
as N e s/N = 1/r ^> 1, whereas for nearly ideal detectors the saving in this asymptotic 
regime is more modest. 

Alternatively, the advantage discussed above can also be quantified by the effective 



measurement-noise reduction, or equivalently, the effective purity r eff (See Sec. 3.2). 



Using (49) one can easily find a simple expression for the effective purity in the 
asymptotic limit and for large abstention rate: r e s = (r + y/Ar + 5r 2 )/[2(l + r)]. In 
the limit of very low noise levels the errors probability i] [recall Eq. ([I])] is effectively 
reduced by a factor of three, i.e., rj e g = rj/3, while in the opposite limit of very noisy 
measurements one finds r e g = yjr. 

3.4- Other regimes 

In the previous section we have seen how a gain in fidelity can be obtained provided 
the 'acceptance' rate Q falls off exponentially as iV becomes very large. Here we give 
an example where this gain takes place even at finite Q. 

At fixed noise level (purity r), the fidelity is an increasing function of N. However, 
one could imagine an experimental setup where the noise (purity) also increases 
(decreases) with N. If this is so, the asymptotic fidelity could be strictly less than 
one, or in other words, perfect estimation could be unattainable even with unbounded 
resources. This is the case in our example, were we assume that r = a/y/N, a being 
a positive constant. Notice that the threshold x* must also scale as 1/yN in order 
to have a reasonably low abstention rate. Therefore, it is convenient to use a new 
variable £ = \/Nx = \/N j / J = 2j/\/N instead. Then, the probability distribution in 
this new variable is 



P«) = ^W* /2 , withr = -^. (51) 
Recalling Eq. (Tl3| and using Stirling formula this equation gives 



V2 J p V ^2 

P(0 = — 7^ £ (52) 



to leading order in inverse powers of N. The subleading terms are of order iV 1//2 and 
will be neglected here. For a given threshold value £* = 2j*/\/N the abstention rate is 



r 2 



Q= I P(0 <% = \ (erf + erf C) - 6 ~ " ' , (53) 



o 



where £± = (£* ± a)/ \/2 and erf x is the error function. 



Beating noise with abstention in state estimation 



15 




Q 



Figure 5. Plot of the fidelity as a function of Q for r — a/yN, with the choice a = 1.0, 
N = 10 6 (red circles) . The solid line (in blue) is the leading term in Eq. ( |57| plotted as 
a function of Q [a parametric plot of the pairs (Q,F), as given by Eqs. (|53[) and (57l] 



From Eqs. ([171) and (18) we have in this same regime and at leading order 

2 1 

~ 1 _ e 2< ~ ^* 



A(0 = A 



With the above, the fidelity (29), [or rather, the counterpart of p6J)] is 



1 + 



dfp(0A(0 



1 + 



dep(OA(o 



where the last integral can be computed to be 



A* = 



a(Op(o^ 



(54) 



(55) 



(56) 



I -a 2 
2a? 



(erfe-erfe;) + 



t* 2 t* 

2na 



We can finally write the fidelity as 



A* 



1-Q 



0(N- 



-l/2s 



(57) 



As shown in (53) and (56), both Q and A* are functions of the filtering threshold £*, 
which is just a properly scaled version of the original threshold j*. Finding the maximum 
fidelity for a given rate of abstention Q requires inverting Eq. (53) to obtain £,*(Q), but 
this cannot be done analytically and one has to resort to numerical methods. 

In Fig. [5] we plot F as a function of Q for a = 1. The increase of the fidelity in the 
asymptotic regime of large iV is clearly seen: e.g., an abstention rate of a 50% yields a 
rise of about 10%, and it goes up to about 30% for higher (but still reasonable) values 
of Q. The figure also shows the agreement between the approximate form of the fidelity 
given by Eqs. (53) to (57) and the numerical evaluation of its exact expression in (29). 

It should be noted that in the regime described here a rise of the input size N fails 
to replicate the fidelity improvement that results from increasing the rate of abstention 
(no iV e fT can be defined in this regime), thus abstention appears to be the only means 
by which one can improve estimation. 
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4. Conclusions 

In this work we have addressed optimal estimation of pure qubit states when abstention 
from providing an outcome is allowed. We have considered a reasonably realistic 
multiple-copy scenario, where a sample of N identically prepared systems go through 
a non-ideal (noisy) process of measurement. We have shown that in the limit of zero 
noise, abstention does not help to improve estimation (it does not hamper it either). 
However, abstention turns out to counterbalance the adverse effect of errors in a noisy 
process of measurement. We have shown that in general abstention is most useful for 
inputs of few copies and for error rates of the order of a few percent. E.g., for N = 6 
and a value of the error probability of r] = 0.5 (per qubit), one can easily attain fidelity 
gains of the order of 15% with an abstention rate of Q = 4/5. As N increases, one needs 
to allow for higher abstention rates to obtain a significant improvement. We have given 
analytical asymptotic expressions of the fidelity valid in the limit of large number of 
copies. In this limit, abstention can have the effect of increasing the number of copies 
by a constant fraction: N e s/N = x* /r (x* > r), with an acceptance rate Q given by the 
relative entropy: — (1/N) logQ = H[{1 + x*)/2 || (1 + r)/2]. For low levels of noise this 
amounts to reducing the error probability rj by a factor of up to three. 

We have also considered a scenario where the noise (per qubit) increases with the 
number of copies in such a way that perfect estimation is unattainable (limjv_ 5 . 00 F < 1). 
In this case one can obtain a significant enhancement of the asymptotic fidelity (few 
percent) even for finite abstention probabilities Q < 1. Moreover, in such scenario 
abstention appears to be the only way to improve estimation. 

In broader parameter estimation contexts, where, e.g., phase or direction 
information is encoded in more general many-particle states [10], abstention may have a 
much more dramatic effect. These issues will be analyzed in a separate publication [25J. 

Acknowledgments 

We acknowledge financial support from ERDF: European Regional Development Fund. 
This research was supported by the Spanish MICINN, through contract FIS2008-01236 
and the Generalitat de Catalunya CIRIT, contract 2009SGR-0985. We thank G. 
Chiribella for useful discussions. 

References 

[1] A.S. Holevo, Probabilistic And Statistical Aspects Of Quantum Theory, North-Holland Series In 

Statistics And Probability (North- Holland, Amsterdam 1982). 
[2] M.Paris and J. Rehacek (Eds.), Quantum State Estimation, Lect. Notes Phys. 649 (Springer, Berlin 

Heidelberg 2004). 
[3] S. Massar and S. Popescu, Phys. Rev. Lett. 74, 1259 (1995). 
[4] Z. Hradil, Phys. Rev. A 55, 1561 (1997). 

[5] R. Derka, V. Buzek and A.K. Ekert, Phys. Rev. Lett. 80, 1571 (1998). 
[6] D. Bruss and C. Macchiavello, Phys. Lett. A 253, 249 (1999). 



Beating noise with abstention in state estimation 



17 



[7] D.G. Fischer, S-H. Kienle and M. Freyberger, Phys. Rev. A 61, 032306 (2000). 
[8] E. Bagan, M. Baig, and R. Munoz-Tapia, Phys. Rev. Lett. 89, 277904 (2002). 
[9] E. Bagan, M.A. Ballester, R.D. Gill, A. Monras, and R. Munoz-Tapia, Phys. Rev. A 73, 
032301 (2006). 

[10] E. Bagan, M. Baig, A. Brey, R. Munoz-Tapia, and R. Tarrach, Phys. Rev. Lett. 85, 5230 (2000); 

Phys. Rev. A 63, 052309 (2001). 
[11] R. Blume-Kohout, New. J. Phys. 12, 043034 (2010). 

[12] R.B.A. Adamson and A.M. Steinberg, Phys. Rev. Lett. 105, 030406 (2010). 

[13] D. Gross, Y.K. Liu, S.T. Flammia, S. Becker and J. Eiscrt, Phys. Rev. Lett. 105, 150401 (2010). 
[14] J. A. Bcrgou, Journal of Modern Optics 57, 160 (2010). 

[15] C. W. Helstrom, Quantum Detection and Estimation Theory (Academic Press, New York, 1976). 
[16] I. D. Ivanovic, Phys. Lett. A 123, 257 (1987); D. Dicks, Phys. Lett. A 126, 303 (1988); A. Peres, 

Phys. Lett. A 128, 19 (1988); G. Jaeger and A. Shimony, Phys. Lett. A 197, 83 (1995). 
[17] A. Chefles and S. M. Barnctt, J. Mod. Opt. 45, 1295 (1998). 
[18] C.-W. Zhang, C.-F. Li, and G.-C. Guo, Phys. Lett. A 261, 25 (1999). 
[19] J. Fiurasek and M. Jezck, Phys. Rev. A 67, 012321 (2003). 
[20] Y. C. Eldar, Phys. Rev. A 67, 042309 (2003). 

[21] A. Hayashi, T. Hashimoto, and M. Horibc, Phys. Rev. A 78, 012333 (2008). 
[22] H. Sugimoto, T. Hashimoto, M. Horibe, and A. Hayashi, Phys. Rev. A 80, 052322 (2009). 
[23] E. Bagan, R. Munoz-Tapia, G.A. Olivares-Renteria and J. A. Bergou, in preparation. 
[24] J. Fiurasek, New. J. Phys. 8, 192 (2006). 

[25] B. Gendra, E. Ronco-Bonvehi, J. Calsamiglia, R- Munoz-Tapia, and E. Bagan, in preparation. 
[26] J.I. Cirac, A.K. Ekcrt and C. Macchiavello, Phys. Rev. Lett. 82, 4344 (1999). 
[27] D. Bacon, I. Chuang and A. Harrow, Phys. Rev. Lett. 97, 170502 (2006). 

[28] T.M. Cover and J. A. Thomas, Elements of Information Theory, 2nd Ed., Wiley Series in 
Telecommunications and Signal Processing, (John Wiley & Sons, Inc., Hoboken, New Jersey, 
2006). 



